Samuel Kaski and Jaakko Peltonen

نویسندگان

Samuel Kaski

Jaakko Peltonen

چکیده

D imensionality reduction is one of the basic operations in the toolbox of data analysts and designers of ma chine learning and pattern recognition systems. Given a large set of measured variables but few observations, an obvious idea is to reduce the degrees of freedom in the measurements by representing them with a smaller set of more “condensed” variables. Another reason for reducing the dimensionality is to reduce computational load in further processing. A third reason is visualization. “Looking at the data” is a central ingredient of exploratory data analysis, the first stage of data analysis where the goal is to make sense of the data before proceeding with more goal-directed modeling and analyses. It has turned out that although these different tasks seem alike, their solution requires different tools. In this article, we show that dimensionality reduction for data visualization can be represented as an information retrieval task, where the quality of visualization can be measured by precision and recall measures and their smoothed extensions. Furthermore, we show that visualization can be optimized to directly maximize the quality for any desired tradeoff between precision and recall, yielding very well-performing visualization methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Samuel Kaski and Jaakko Peltonen

نویسندگان

چکیده

منابع مشابه

Learning Metri s for Self-Organizing Maps

Dependency exploration and learning metrics

Targeted learning by imposing asymmetric sparsity

Modeling of relevance

Chapter 11 Learning metrics

Chapter 5 Bioinformatics

عنوان ژورنال:

اشتراک گذاری